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ABSTRACT 

Coronaviruses are enveloped, single- 
stranded RNA viruses that are distributed 
worldwide. They include transmissible 
gastroenteritis virus (TGEV), porcine epidemic 
diarrhea virus (PEDV), and the human 
coronaviruses severe acute respiratory syndrome 
coronavirus (SARS-CoV) and Middle East 
respiratory syndrome coronavirus (MERS-CoV), 
many of which seriously endanger human health 
and well-being. Only alphacoronaviruses and 
betacoronaviruses harbor nonstructural protein 1 
(nsp1), which performs multiple functions in 
inhibiting antiviral host responses. The role of the 
C terminus of betacoronavirus nsp1 in virulence 
has been characterized, but the location of the 
alphacoronavirus nsp1 region that is important for 
virulence remains unclear. Here, using TGEV 


nspl as a model to explore the function of this 
protein in alphacoronaviruses, we demonstrate 
that alphacoronavirus nspl inhibits host gene 
expression. Solving the crystal structure of full- 
length TGEV at 1.85 A resolution and conducting 
several biochemical analyses, we observed that a 
specific motif (amino acids 91-95) of 
alphacoronavirus nsp1 is a conserved region that 
inhibits host protein synthesis. Using a reverse- 
genetics system based on CRISPR/Cas9 
technology to construct a recombinant TGEV in 
which this specific nsp1 motif was altered, we 
found that this mutation does not affect virus 
replication in cell culture, but significantly 
reduces TGEV pathogenicity pigs. Taken together, 
our findings suggest that alphacoronavirus nsp1 is 
an essential virulence determinant, providing a 
potential paradigm for the development of a new 
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attenuated vaccine based on modified nsp1. 


Coronaviruses (CoVs) belong to a large 
family of enveloped, single-stranded RNA viruses 
that are serious threats to public health and have 
caused considerable damage (1-5). Depending on 
serotype, CoVs can be generally divided into 
alpha-, beta-, gamma-, and deltacoronavirus (a- 
CoV, B-CoV, y-CoV, and 6-CoV, respectively) 
genera (2). The a-CoVs include TGEV, PEDV, 
human coronavirus 229E (HCoV-229E), human 
coronavirus NL63 (HCoV-NL63), and feline 
infectious peritonitis virus (FIPV) (1). The B- 
CoVs include murine hepatitis virus (MHV), 
SARS-CoV, and MERS-CoV (1). 

CoV nonstructural protein 1 (nsp1) is the 
N-terminal component of the ppla polyprotein. 
Only a-CoVs and B-CoVs encode nspl, whereas 
y-CoVs and 6-CoVs lack this protein (6). Nsp1 
sequences identified using standard search tools, 
such as the BLAST, are divergent among different 
CoVs. The sizes of a-CoV nsp1 and B-CoV nsp1 
differ; the a-CoVs encode nsp1 proteins of ~9 
kDa, which are substantially smaller than the ~20 
kDa nspl proteins of B-CoVs. Although the 
sequence homologies among TGEV, PEDV, and 
SARS-CoV nspl proteins are low, the core 
structures share a relatively conserved domain (7- 
9). This high structural similarity may explain 
why CoV nspl has the conserved biological 
function of inhibiting host gene expression. 
However, the critical region of B-CoV nspl 
required to inhibit protein synthesis is different 
from that of a-CoV nsp1 (9-12). This difference 
could imply that the detailed mechanisms through 
which CoV nsp1 proteins inhibit host gene 
expression are unpredictable. To date, the ability 
of B-CoV nsp1 to inhibit host gene expression has 
been deeply studied. SARS-CoV nsp1 not only 
prevents mRNA translation but also promotes its 
degradation by binding to the 40S ribosomal 


subunit (13). Furthermore, MERS-CoV nspl 
selectively targets mRNA synthesized in the host 
cell nucleus for degradation and thus inhibits 
translation in host cells (14). TGEV and PEDV 
nspl proteins cannot bind the 40S ribosomal 
subunit to inhibit the translation of host mRNA; 
however, HCoV-229E and HCoV-NL63 nspl 
proteins might bind the 40S ribosomal subunit to 
affect host mRNA stability (9, 15, 16). 

At present, the specific pathway by which 
a-CoV nspl inhibits host gene expression is 
unknown. However, the function of CoV nsp1 in 
viral immune system evasion has been well 
characterized in cell culture. For example, SARS- 
CoV nsp1 suppresses interferon (IFN) expression 
and host antiviral signaling pathways in infected 
cells (10, 17). Furthermore, MHV nsp1 efficiently 
interferes with the type I IFN system (11), 
whereas PEDV nspl mediates CREB-binding 
protein (CBP) and nuclear factor-kB (NF-KB) 
degradation to inhibit type II IFN responses (12, 
18). Thus, nsp1 is considered a possible major 
virulence factor for CoVs. Moreover, the 
contribution of nspl to CoV pathogenesis has 
been directly demonstrated for MHV and SARS- 
CoV (11, 17). However, little direct evidence has 
indicated that a-CoV nsp1 is a virulence factor. 
Here, we present the crystal structure of full- 
length TGEV nspl. Further structural and 
biochemical analyses indicated that a motif 
(amino acids 91—95) is important for the inhibition 
of host gene expression by a-CoV nspl. In 
addition, we demonstrate that the loss of nsp1- 
induced inhibition of host protein synthesis does 
not affect the replication of TGEV but can 
significantly reduce its virulence in piglets. This 
research improves our understanding of why a- 
CoV nspl is necessary for virulence and may aid 
in the development of a new attenuated vaccine. 


Results 
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Transient a-CoV nspl expression affects cellular 
gene expression 

Using Renilla luciferase (Rluc) assays, we 
assessed several representative @-CoV nspl 
proteins for their ability to interfere with host cell 
gene expression. Human embryonic kidney 
(HEK-293T) cells were cotransfected with 
plasmids expressing the nspl protein and pRL- 
SV40 plasmids expressing SV40 promoter-driven 
Rluc. The results showed that TGEV, FIPV, 
HCoV-229E, HCoV-NL63 and PEDV nspl 
significantly reduced luciferase reporter gene 
expression. Furthermore, subsequent western blot 
analysis using equal amounts of intracellular 
proteins confirmed this finding (Figure 1A). 

Then, we used TGEV nsp1 as a model to 
study the detailed function of a-CoV nsp1. Using 
the SARS-CoV nsp1 protein as a positive control 
(13), we examined Rluc mRNA in HEK-293T 
cells using real-time quantitative PCR and found 
that TGEV nsp1 inhibited Rluc mRNA synthesis 
at different time points (Figure 1B). To further 
explore whether TGEV nsp1 could broadly inhibit 
gene expression in HEK-293T cells, we examined 
host proteins using a ribopuromycylation assay. 
The broad-spectrum inhibitory activity of TGEV 
nsp1 on host protein synthesis was dependent on 
both concentration and time (Figure 1C and D). 


Identification of the critical region of TGEV 
nspl required for inhibition of protein synthesis 
Although the structure of a truncated 
TGEV nsp1 protein (Protein Data Bank (PDB) 
accession number 3ZBD) has been reported (8), 
we wondered whether the C-terminus could affect 
the overall structure. We performed initial 
crystallization trials for the full-length TGEV 
nsp1 (residues Met1 to Arg109). Fortunately, we 
were able to determine the crystal structure of 
full-length TGEV nspl using the molecular 


replacement method and refined it to a 1.85-A 
resolution in space group P2,. The details of the 
phasing and refinement steps are given in Table 1. 
Although the crystal resolution was very high, the 
specific conformation of the C-terminus remained 
unclear. This was probably because the C-terminal 
loop was flexible, for which it is difficult to obtain 
accurate information. Structural characterization 
demonstrated that the structure of full-length 
TGEV nsp1 displayed eight B-sheets and two a- 
helices (Figure 2A), and the full-length protein 
shared a common skeleton with the truncated 
protein. The similarity of the three-dimensional 
conformations indicated that the C-terminus had 
little influence on the overall TGEV nspl 
structure. 

To identify which region in TGEV nsp1 
was essential for its inhibitory activity, we used 
methods that have been previously used to study 
important functional regions in PEDV nsp1 (9). 
Based on the crystal structure of TGEV nsp1, we 
designed the following series of plasmids in which 
random coils were replaced with flexible Ser-Gly- 
Ser-Gly (sg) linkers: nsp1(1—3sg), nsp1(11—15sg), 
nsp1(19—22sg), nsp1(37-40sg), nsp1(46—-48sg), 
nsp1(52—60sg), nsp1(66-69sg), nsp1(76—84sg), 
nsp1(91—95sg) and nsp1(105—109sg) (Figure 2B 
and C). The Rluc assay showed that the loops 
comprising amino acids 37—40 and 91-95 were 
the most important regions for the ability of 
TGEV nsp1 to inhibit reporter gene expression 
(Figure 2C). Subsequently, western blot analysis 
indicated that nsp1(91—95sg) expression was 
significantly higher than nsp1 expression (Figure 
2C), again demonstrating that nsp1 could inhibit 
its own expression (9, 15). We examined Rluc 
mRNA expression in HEK-293T cells by real- 
time quantitative PCR and found that TGEV 
nsp1(91—95sg) and TGEV nsp1(37—-40sg) did not 
inhibit Rluc mRNA synthesis at different time 
points (Figure 3A). We conducted the 
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ribopuromycylation assay and the results showed 
that TGEV nsp1(37-40sg) significantly reduced 
the inhibition of host gene expression (Figure 3B). 
Because the TGEV nsp1(37-40sg) protein was 
found to be insoluble in the Escherichia coli 
supernatant, the motif (amino acids 37-40) was 
not analyzed further. Based on the above 


experiments, we focused on TGEV nsp1(91—95sg). 


Finally, the 
demonstrated that the mutant protein no longer 


ribopuromycylation assay 


inhibited host protein synthesis in a dose- or time- 
dependent manner (Figure 3C and D). These data 
indicate that the motif comprising amino acids 
91-95 is critical for TGEV nspl-induced 
suppression of host gene expression. 


Structural similarity of wild-type and mutant 
TGEV nsp1 

To explore whether a loss of protein 
activity by TGEV nsp1(91—95sg) was due to 
structural changes, we constructed a TGEV 
nsp1(91—-95sg) plasmid. Expression and 
purification were performed as described for the 
full-length construct. This new construct yielded 
crystals under different crystallization conditions 
than those of the full-length protein, and native 
data were collected to 1.98 Å. The details of the 
phasing and refinement steps are given in Table 1. 
A comparison of surface electrostatics and shape 
between TGEV nspl and TGEV nsp1(91-95sg) 
led us to speculate that the sg linker mutation from 
amino acids 91 to 95 did not significantly change 
the overall structure of TGEV nsp1 (Figure 4A-F). 


The motif comprising amino acids 91-95 is 
important for the regulation of host translation 
by a-CoV nsp1 

We next determined whether this motif 
(amino acids 91—95) plays an important role in 
inhibiting host protein synthesis in the context of 
other a-CoV nspl proteins. To explore the 


corresponding motif in a-CoV nspl, we selected 
the previously mentioned representative 
sequences and aligned them using ClustalW2 
software. The o-CoV nspl amino acid sequence 
alignment showed that the motif comprising 
amino acids 91—95 was a relatively conserved 
domain (Figure 5A). Then, we constructed mutant 
nspl proteins from FIPV, HCoV-229E, HCoV- 
NL63, and PEDV to explore whether the a-CoV 
nspl motif comprising amino acids 91—95 was an 
important region for the suppression of host gene 
expression. Luciferase assays indicated that the 
important function of this motif (amino acids 91— 
95) in inhibiting host gene expression not only 
was exhibited by TGEV nsp1 but also could be 
extended to nspl proteins from other a-CoVs 
(Figure 5B). Further, western blot analysis 
revealed that the expression of the mutant proteins 
was higher than that of the wild-type proteins, 
especially for TGEV nsp1 and HCoV-NL63 nsp1 
(Figure 5C), consistent with the results of the 
luciferase assays. In addition, we compared the 
Rluc values between the mutant and the wild-type 
nsp1 proteins (Figure 5D). These data showed that 
the relevant motif (amino acids 91—95) might be 
responsible for inhibiting host gene expression, 
albeit to different degrees, among the a-CoV nsp1 
proteins. Consistent with our previous results (9), 
when we mutated the motif (amino acids 91—95) 
in PEDV nsp1, we again observed partial recovery 
of the ability to suppress Rluc expression. Taken 
together, these results indicated that the motif 
comprising amino acids 91—95 was an important 
conserved region for the regulation of host 
translation by a-CoV nsp1. 


Construction and recovery of a TGEV mutant 
virus 

Although a-CoV nspl expression is 
known to suppress host gene expression, its 
biological functions in viral replication have been 
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largely unexplored. To evaluate the role of TGEV 
nspl in inhibiting host gene expression during 
viral replication, we used our reverse genetics 
system to construct a recombinant TGEV 
encoding a mutant nspl protein in which open 
reading frame 3 (ORF3) was replaced by green 
fluorescent protein (GFP) (19). Based on the 
previously described results, we replaced the 
important motif (amino acids 91—95) in the nsp1- 
coding sequence. In the resulting mutant virus, the 
replicase gene start codon, translational reading 
frame, and residues required for proteolytic 
release of nspl from the replicase polyprotein 
were maintained (Figure 6A). The mutant virus 
was successfully amplified by transfecting 
corresponding bacterial artificial chromosome 
(BAC) plasmids into HEK-293T cells. Then, we 
collected the amplified virus 24 h after 
transfection to infect TGEV-susceptible cells. 
Characteristic green fluorescence consistent with 
the parental virus phenotype was observed 36 h 
after infection (Figure 6B). The recombinant virus, 
which was subsequently named TGEV(91—95sg), 
was verified by sequencing the corresponding 
replaced regions. To assess the stability of 
TGEV(91—95sg), we analyzed the nspl-coding 
region by RT-PCR sequencing after seven 
passages in PK15 cells, and no nucleotide changes 
were detected. Furthermore, western blot analysis 
confirmed that the virus successfully infected 
PK15 and swine testicle (ST) cells (Figure 6C). 


Growth properties of TGEV and TGEV(91-95sg) 

Following the successful recovery of 
TGEV(91-95sg), we next evaluated the role of 
nspl in inhibiting host gene expression during 
viral replication. The growth properties of 
TGEV(91—95sg) in various cell lines were 
measured by performing a multistep growth assay 
with a viral multiplicity of infection (MOD of 0.1. 
The results showed that both viruses reached a 


plateau at approximately 48 h post infection in 
PK15 and ST cells. Furthermore, the viral growth 
and peak titers of TGEV(91—95sg) in susceptible 
cells were indistinguishable from those of the 
wild-type virus, indicating that the nsp1 gene was 
not critical for viral growth in cell culture (Figure 
6D and E). To confirm this conclusion, we also 
tested the nspl expression level during viral 
infection and found that it was indistinguishable 
between the mutant and wild-type viruses (Figure 
6C). 


Pathogenicity of TGEV and TGEV(91-95sg) 

To test the pathogenicity of TGEV(91— 
95sg), an animal experiment was carried out. 
Piglets that had not been breastfed at birth were 
randomly divided into two groups with five 
piglets in each per group; in addition, a mock- 
infected control group was formed that contained 
three piglets. The piglets were orally inoculated at 
a dose of 1x10° 50% tissue culture infective doses 
(TCID;9) with the respective chimeric virus or 
mock-infected with Dulbecco’s modified Eagle’s 
medium (DMEM). The animal experiments 
showed that replacement of the selected motif 
(amino acids 91—95) reduced the pathogenic 
properties of TGEV. Furthermore, in the TGEV 
group, all five piglets exhibited obvious 
dehydration and weight loss. Severe diarrhea 
began at 48 h post infection, and all piglets died 
within 96 h, indicating the acquisition of lethal 
characteristics (Figure 7C). Interestingly, the 
clinical signs were significantly less severe in the 
TGEV(91—95sg) group than in the TGEV group 
(Figure 7A-C). To further assess the attenuation of 
TGEV(91-95sg), we performed postmortem 
examinations and hematoxylin and eosin (H&E) 
staining. In the TGEV group, the stomach was 
obviously inflated and had undigested flocculate. 
Moreover, the gastric fundus mucosa was 
congested, flushed, and bleeding. The stomach 
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wall and small intestinal wall became thin, which 
was especially true for the jejunum, and animals 
presented with intestinal dilation with an 
abundance of yellow liquid. In the TGEV(91— 
95sg) group, the stomach was slightly inflated, 
and the gastric fundus mucosa was not bleeding. 
The small intestinal wall was slightly thinner than 
that in the mock group. Overall, the damage to the 
small intestine and stomach in the TGEV group 
was significantly worse than that in the 
TGEV(91—95sg) group (Figure 7D). Moreover, 
H&E staining revealed extensive fine damage to 
different segments of the intestine, characterized 
by the fragmentation and shedding of intestinal 
villi. The results showed that the TGEV(91—95sg) 
group had only mild lesions in both the intestine 
and stomach compared to the TGEV group 
(Figure 7E). 

Next, we investigated how  nspl 
contributed to the virulence of TGEV by 
analyzing the viral infection efficiency, as 
reflected by virus shedding (Figure 8A), and viral 
tissue distribution (Figure 8B). Fecal samples 
were obtained with fecal swabs, and virus 
shedding was tested by regular PCR. The 
shedding of these groups was measured on days 1 
to 7 and was terminated because of the quick 
deaths of the infected piglets. At 7 days post 
inoculation, all surviving piglets were euthanized 
to reduce the stress of the other piglets. The 
results showed that the challenged piglets had 
viruses in their feces. In contrast, no detectable 
virus shedding was found in the mock-infected 
pigs. Furthermore, we performed sequencing of 
the corresponding replaced regions to confirm the 
stability of the 
Immunohistochemistry showed that the virus was 


infective virus. 


present in susceptible intestinal tissues, consistent 
with the clinical results. Collectively, these data 
demonstrate that TGEV(91—95sg) is strongly 


attenuated in vivo, and we conclude that the nsp1 
gene is necessary for virulence. 


Discussion 

Many viral proteins suppress host gene 
expression and modify host cell environments to 
promote virus-specific translation (20, 21). As a 
result of such processes, these proteins are often 
considered important virulence factors that 


contribute to the design of live attenuated vaccines. 


For example, nonstructural protein 1 (NS1) of the 
influenza virus plays a crucial role in replication 
and pathogenesis by suppressing ASC 
ubiquitination; thus, vaccination with a live 
attenuated H5N1 influenza vaccine lacking NS1 is 
safe (22-24). The virion host shutoff (vhs) protein 
is the key regulator of the induced early host 
shutoff response, and herpes simplex viruses with 
vhs deleted have been proposed as live attenuated 
vaccines (25, 26). The human metapneumovirus 
(hMPV) protein M2-1 phosphorylation plays 
important regulatory roles in RNA synthesis, 
replication, and pathogenesis, and inhibition of 
M2-1 phosphorylation may serve as a novel 
approach for development of live attenuated 
vaccines (27). 

This phenomenon is also widely found in 
CoVs. Envelope proteins affect virulence, and an 
attenuated virus with Envelope gene deletion is a 
promising vaccine candidate (28-34). Although 
many viral proteins can induce host translational 
shutoff, nsp1 was the first CoV gene 1 protein 
recognized to play an important role in inhibiting 
host gene expression to regulate viral replication. 
In B-CoVs, deletion of the functional region of 
nspl will be critical for developing attenuated 
vaccines (11, 17, 35). However, the role of nsp1 in 
a-CoV pathogenesis is unknown. In our study, we 
replaced the important motif in the TGEV nsp1- 
coding sequence using our reverse genetics 
technology. Then, we successfully recovered the 
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mutant virus. Subsequently, the viral growth and 
peak titers of the mutant virus in susceptible cells 
were found to be indistinguishable from those of 
the wild-type virus, indicating that the nsp1 gene 
was not a critical factor driving viral growth in 
cell culture (Figure 6D and E). To confirm this 
conclusion, we also tested the nspl expression 
level during viral infection and found that it was 
indistinguishable between the mutant and wild- 
type viruses (Figure 6C). Furthermore, the animal 
experiments revealed that the determinant region 
within TGEV nsp1 was critical for viral virulence 
based on gain-of-function studies. Taken together, 
the results of this study demonstrate, for the first 
time, that loss of a-CoV nsp1-induced inhibition 
of host gene expression does not affect viral 
replication but significantly reduces virulence. 
Among the four CoV genera, only a-CoV 
and B-CoV encode nsp1l. Previous studies have 
indicated that the mechanisms by which nsp1 
suppresses host gene expression may differ among 
CoV species (13-15, 36). The mechanisms of 
nspl-induced host gene suppression have been 
well characterized for B-CoVs. Nsp1 proteins of 
bat CoVs have been shown to bind viral RNA and 
to suppress host translation (37). SARS-CoV nsp1 
uses a two-pronged strategy to inhibit host gene 
expression by first interacting with the 40S 
ribosomal subunit and then inactivating translation 
(13). MERS-CoV nsp1 selectively targets 
transcribed mRNA to inhibit host gene expression 
(14). MHV nsp1 interferes efficiently with the 
type I IFN system to enhance virulence (11). 
Nonetheless, structural analysis indicates that 
CoV nspls have a common origin (7-9) and that 
the nspl proteins of a-CoVs and B-CoVs share a 
biological function to inhibit host gene expression. 
Furthermore, previous studies have revealed that 
the C-terminus of MHV and SARS-CoV nsp1 is 
an important functional region for inhibiting host 
protein synthesis (10, 11). However, the 


conserved important domain underlying the a- 
CoV nspl-induced inhibition of host gene 
expression is unclear. Although we have reported 
that the loops at amino acid positions 67—71, 78— 
85, and 103-110 generate a stable functional 
region (9), this functional domain is not present in 
the TGEV nspl protein (12). To identify the 
common functional domain in a-CoV nspl, we 
selected TGEV nspl as a model. Although the 
structure of a truncated TGEV nspl (PDB 
accession number 3ZBD) has been reported (8), 
we wondered whether the C-terminus could affect 
the overall structure. Subsequently, we determined 
the structure of full-length TGEV nsp1. Structural 
characterization revealed that full-length TGEV 
nspl shared a common skeleton with truncated 
nsp1l. These similar conformations suggested that 
the C-terminus had no influence on the overall 
structure of TGEV nsp1. Although 37-40 mutant 
protein significantly reduced the inhibition of host 
gene expression, the mutant virus with nsp1(37— 
40sg) was not succeed. Previous study has also 
reported that 5’-untranslated region of coronavirus 
can effect on viral replication (38), we are not sure 
whether the rescue failure of this mutant virus is 
due to the loss of nspl activity or effect of 5’- 
untranslated region. Through structural analysis 
and biochemical experiments, we verified that the 
motif comprising amino acids 91-95 was 
necessary for the inhibition of host gene 
expression by nsp1. Interestingly, this motif could 
be expanded to other representative a-CoV nsp1 
proteins (Figure 5). In our study, we found that the 
motif comprising amino acids 91-95 is a 
conserved region in a-CoVs, which is conducive 
to understanding the mechanism of a-CoV nsp1 
inhibition of host gene expression. 

In summary, using a reverse genetics 
platform, we obtained the first evidence that 
TGEV nsp1 does not affect replication ability but 
significantly reduces pathogenicity. We also 
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found that the motif comprising amino acids 91— 
95 in a-CoV nspl is a common functional region 
involved in the inhibition of host gene expression. 
Overall, our results add detail to the molecular 
mechanism of a-CoV virulence, which might 
contribute to the design of a novel attenuated a- 
CoV vaccine based on nsp1 modifications. 


Experimental procedures 


Cells and viruses 

HEK-293T and ST cells (ATCC) were 
maintained at 37°C with 5% CO, in Gibco DMEM 
(high glucose, 4.5 g/L) containing 10% fetal 
bovine serum. PK15 cells (ATCC) were 
maintained in Gibco DMEM (low glucose, 1 g/L) 
supplemented with 10% fetal bovine serum. The 
pBAC-TGEV-GFP plasmid containing a full- 
length infectious cDNA clone, which was derived 
from the highly virulent TGEV strain WH-1 
(GenBank accession number HQ462571), has 
been described previously (19). To acquire the 
TGEV recombinant virus, the plasmid was first 
transfected into HEK-293T cells, and then the 
virus was recovered to infect virus-susceptible 
cells. 


Plasmid construction 

For expression in E. coli, the full-length 
gene sequence of TGEV nspl (GenBank 
accession no. HQ462571.1) was cloned into a 
pET-42b (+) vector with an N-terminal His, tag 
via PCR amplification. The forward and reverse 
primers contained the NdeI and Xhol restriction 
sites, respectively. The mutant TGEV nspl 
sequence was inserted into pET-42b using the 
same method. 

For expression in eukaryotic cells, wild- 
type nspl_ flanked with an N-terminal 
hemagglutinin (HA) tag was cloned into the 
PCAGGS vector using the EcoRI and Xhol 


restriction sites. To determine the functional 
region of TGEV nspl, we used a previously 
reported method (9). The following plasmids were 
engineered in which a random coil was replaced 
with a flexible sg linker: nsp1(1—3sg), nsp1(11— 
15sg), nsp1(19—22sg), nsp1(37—-40sg), nsp1(46— 
48sg), nsp1(55—60sg), nsp1(66—69sg), nsp1(76— 
84sg), nsp1(91—95sg) and nsp1(105—109sg). All 
of the recombinant expression plasmids were 


sequenced, and no unexpected mutations occurred. 


To construct the recombinant virus 
containing mutant nspl, a method previously 
reported by our laboratory was used (19). The 
sequences of the relevant primers are shown in 
Table 2. Each corresponding mutant virus was 
efficiently constructed using CRISPR/Cas9 
technology. Briefly, two specific restriction sites 
encompassing the TGEV nspl sequence were 
selected. Then, we synthesized two types of 
single-stranded DNA forward primers 
(sgnsp1F(91—95sg) and sgnspI1R(Q1—95sg)) and a 
constant reverse primer (sgRNA) corresponding to 
the two restriction sites. After performing 
annealing PCR using the forward and reverse 
primers, the purified short DNA fragments from 
the PCR products were transcribed using T7 RNA 
polymerase. The transcribed products 
corresponding to the two sites were incubated 
with the nuclease Cas9 to digest the pTGEV-GFP 
BAC plasmid in vitro, and the digestion yielded a 
linearized BAC and an ~1.8-kb DNA fragment 
that included the TGEV nspl _ sequence. 
Specifically, pTGEV-GFP BAC was digested in a 
50-uL reaction mixture with 5 ug of pTGEV-GFP 
BAC, 5 uL of Cas9 (NEB), 10 ug of sgRNA and 5 
uL of Nuclease Reaction Buffer (NEB) at 37°C 
overnight. 


Protein expression and purification 
For protein expression, the recombinant 
plasmid was transformed into E. coli Trans BL21 
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(DE3) cells, which were grown at 37°C in 
lysogeny broth (LB) medium containing 50 
ug/mL kanamycin until the optical density at 600 
nm (ODeďoo) reached 0.6-0.8. Then, 1 mM 
isopropyl-B-D-thiogalactopyranoside (IPTG) was 
added to induce cell growth for 8 h at 27°C. 
Purification was performed using a previously 
reported method (9). The target protein was 
further purified by a Superdex200 (GE Healthcare) 
column with elution buffer (20 mM Tris-HCl and 
200 mM NaCl, pH 7.4). For crystallization, the 
purified protein was concentrated to 
approximately 10 mg/mL. 


Crystallization, data collection, and structure 
determination 

TGEV nspl and TGEV nsp1(91—95sg) 
were crystallized via the sitting drop vapor 
diffusion method at 20°C. Optimal crystals were 
obtained under the following conditions through 
screening and optimization: 0.5 M nickel chloride 
and 4% PEG3350 for TGEV nspl and 0.2 M 
ammonium phosphate dibasic and 23% PEG3350 
for TGEV nsp1(91—95sg). Single crystals were 
first washed with 5%, 10%, 15% and 30% 
ethylene glycol (v/v) as a cryoprotectant and then 
flash-frozen in liquid nitrogen. All data collection 
steps were performed on the beamline BL17U at 
the Shanghai Synchrotron Radiation Facility 
(SSRF) using a MAR 225 CCD detector (MAR 
Research). All of the diffraction images were 
integrated, merged, and scaled using HKL-3000 
software (39). The structures were solved by 
molecular replacement with PHASER (40) using 
the truncated structure of TGEV nspl (PDB 
identifier 3ZBD) as a starting model. Manual 
model building was performed using Coot (41), 
and the structures were refined with Phenix (42). 
The refinement statistics are shown in Table 1. All 
of the structural figures were drawn using PYMOL 
(43). The root mean square deviation (RMSD) 


PDBeFold 
(http://pdbe.org/fold/). The amino acid sequences 


was analyzed using 


of the CoV nsp1 proteins were aligned using the 
ClustalW2 software program (44) and visualized 
with the ESPript 3 server (http://espript.ibcp.fr). 


Reporter assay, ribopuromycylation assay and 
western blot analysis 

The functional plasmid and reporter gene 
plasmids were transfected into HEK-293T cells 
using Lipo3000 according to the manufacturer’s 
instructions. At 24 h post transfection, cell lysates 
were prepared and subjected to an Rluc reporter 
activity assay (Promega). A ribopuromycylation 
assay was performed as described previously (9, 
45). Briefly, cultured HEK-293T cells were 
transfected with different doses of the different 
plasmids. At various time points post transfection, 
the cells were pulse-labeled with 3 uM puromycin 
and then incubated for an additional hour at 37°C 
with 5% CO3. 

To determine the success of plasmid 
transfection or virus infection in the cells, cell 
samples were collected for treatment. For 
treatment, the cells were first gently washed twice 
with precooled PBS and then centrifuged at 4000 
rpm for 5 min. The supernatant was discarded, 
RIPA lysis buffer (Beyotime) was added, and the 
samples were incubated with rotation at 4°C for 
20 min. The extracts were prepared in SDS-PAGE 
sample buffer. Protein expression was analyzed 
via western blotting. The proteins were visualized 
using an anti-HA antibody (Ab; SAB), an anti- 
puromycin Ab (Millipore), an anti-TGEV N Ab 
(provided by the laboratory), and an anti-TGEV 
nspl Ab (provided by the laboratory). 
Glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH) expression was detected with an anti- 
GAPDH monoclonal Ab (MAb; Proteintech) to 
confirm equal protein loading. The corresponding 
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grayscale value of each protein band was analyzed 
with ImageJ. 


Quantitative analysis 
of mRNA was 
performed as described previously (9). Briefly, 


Quantitative analysis 


RNA was extracted from the tissue or cell samples, 
and a reverse transcription kit (Toyobo) was used 
to quantitatively reverse transcribe 100 ng of RNA 
in a 20 uL system. The reverse transcription 
program was performed according to the 
instructions of the reverse transcription kit 
(Toyobo). The target mRNA and the endogenous 
control RNA (18S rRNA) were amplified with a 
TaqMan One-Step Real-time PCR Master Mix 
Reagent Kit (Bio-Rad). 
manufacturer’s instructions, PCR was performed 
on an ABI PRISM 7000 real-time thermocycler 
(Applied Biosystems, Foster City, CA, USA). The 


amount of target mRNA was normalized to the 


Following the 


amount of endogenous 18S rRNA. 


Protein structure accession numbers and 
statistical analysis 

The coordinates and structural 
characteristics of TGEV nspl and TGEV 


nsp1(91—95sg) were submitted to the Research 
Collaboratory for Structural Bioinformatics 
(RCSB) under PDB accession numbers 6IVC and 
6IVD, respectively. Unpaired Student’s t-test and 
one-way analysis of variance with Bonferroni 
correction (Prism version 6.01; GraphPad) were 
used to analyze differences between groups. * 
indicates a significant difference (P < 0.05), ** 
indicates a highly significant difference (P < 0.01), 
and *** indicates 

difference (P < 0.001). 


an extremely significant 
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Animal experiments 

Thirteen 1-day-old piglets from a TGEV- 
free sow were randomly divided into three groups 
and fed fresh liquid milk diluted in warm water 
every 3 h. All piglets were confirmed to be free of 
TGEV, PEDV, porcine deltacoronavirus (PDCoV) 
and rotavirus (RV) by RT-PCR analysis of piglet 
feces before viral challenge. The piglet weights 
were measured and recorded at the beginning of 
the challenge. The piglet challenge group was 
inoculated orally with chimeric viral particles at a 
dose of 1x10° TCIDs) or mock-inoculated with 
500 uL of DMEM. At 7 days post inoculation, all 
surviving piglets were euthanized to reduce the 
stress of the other piglets. Before necropsy, the 
weight of every piglet was recorded. At necropsy, 
four sections of the intestine (the duodenum, 
jejunum, ileum and large intestine) and the 
stomach were collected, fixed in 10% formalin for 
histopathological examination and stained with 
H&E. 


Ethics statement 

The animal experiments were performed 
according to protocols approved by The Scientific 
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Table 1. Data collection and refinement statistics 


TGEV nsp1 TGEV nsp1(91-95sg) 
Data collection 
Space group P2; P2; 
Cell parameter a, b, c (A) 36.16, 67.01, 90.10 33.44, 59.24, 57.17 
a, B, yC) 90.00, 93.98, 90.00 90.00, 97.54, 90.00 
Wavelength 1.0 0.97918 
Resolution range (A) 34.31-1.85 40.95-1.98 
Completeness (%) 96.9 (96.9) 97.6 (97.6) 
R merge (last shell) 0.056 (0.476) 0.025 (0.403) 
Yo (last shell) 35.12 (4.43) 42.35 (9.42) 
Redundancy (last shell) 4.0 (3.9) 3.9 (3.8) 
Refinement 
Resolution (A) 34.31-1.80 40.95-1.97 
Riad Rares 0.179/0.232 0.191/0.237 
No. of reflections 35,753 15,284 
No. of protein atoms 3,490 1,684 
No. of solvent atoms 430 103 
No. of ions/ligands 0 0 
Mean B factor (A’) 20.94 26.55 
RMSD 
Bond length (A) 0.007 0.007 
Bond angle (°) 0.864 0.784 
Ramachandran plot: 
favored, 98.34, 97.58, 
allowed, outlier (%) 1.66, 0.00 2.42, 0.00 


The highest-resolution values are indicated in parentheses. 
Rmerge=>')|li-<I>//)°li, where Ii is the intensity measurement of reflection h and <I> is the average 


intensity from multiple observations. 
Rwork=))||Fo|-|Fc|| /):|Fo|, where Fo and Fc are the observed and calculated structure factors, respectively. 


Rfree is equivalent to Rwork, but 5% of the measured reflections have been excluded from the refinement 


and set aside for cross-validation. 
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Table 2. Sequences of the primers used for CRISPR/Cas9 


Gene Primer sequence (5' to 3') 


AAAAGCACCGACTCGGTGCCACTTTTTCAAGTTGATAACGGACTAGC 


sgRNA CTTATTTTAACTTGCTATTTCTAGCTCTAAAAC 


sgnsp1F(91— = GATCACTAATACGACTCACTATAGCGTAGCAACCAGGCGTTTAGT 
95sg) TTTAGAGCTAGAAA 


sgnspIRQ1— = GATCACTAATACGACTCACTATAGGTTTTAAGACTGCCTGTTGGTT 
95sg) TTAGAGCTAGAAA 


sgRNA is a common bottom oligo for transcription template DNA. The T7 promoter sequence is 
presented in green, and the guide sequence for the targeting of transcription template DNA is presented in 
red. The blue sequence regions of sgnsp1F(91—95sg)/sgnsp1R(91—95sg) are the regions that overlap with 
the sgRNA sequence. 


17 


6107 ‘67 Ane uo ysan3 Aq /310'0qf mmmy/:dyy woy popeojumoq 


A conserved virulence region within alphacoronavirus nsp1 


Figure and figure legends 
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Figure 1. a-CoV nsp1 inhibits protein synthesis. (A) a-CoV nsp1 significantly inhibited the expression 
of reporter genes in HEK-293T cells. The expression of nsp1 and GAPDH was detected by western blot 
analysis using an anti-HA antibody and an anti-GAPDH antibody, respectively (top). (B) HEK-293T cells 
were cotransfected with pRL-SV40 and wild-type plasmids. At 12, 24, and 48 h post transfection, the 
cells were lysed and subjected to real-time quantitative PCR analysis. TGEV nsp1 significantly inhibited 
the synthesis of Rluc mRNA; SARS nsp1 was used as the positive control. (C) Cells were transfected 
with different doses of the TGEV nsp1 plasmid (0 to 2.0 ug) for 24 h. The cells were pulsed with 3 uM 
puromycin for 1 h at 37°C and then subjected to western blot analysis (left). The grayscale values of the 
protein bands were analyzed by ImageJ (right). (D) Cells were pulsed with 3 uM puromycin for 1 h after 
0, 12, 24, and 36 h of transfection and then subjected to western blot analysis (left). The corresponding 
grayscale values of the protein bands were analyzed by ImageJ (right). The error bars show the standard 
deviation of the results from three independent experiments. *P < 0.05, significant; **P < 0.01, highly 
significant; ***P < 0.001, extremely significant. 
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Figure 2. Structure of full-length TGEV nsp1 and identification of the region that inhibits protein 
synthesis. (A) Crystal structure of TGEV nsp1. The structure of nsp1 is shown as a cartoon; a-helices are 
shown in red, and B-sheets are shown in yellow. (B and C) Based on the structure of TGEV nsp1l, 
plasmids with mutated loop regions were constructed. The recombinant plasmids were as follows: 
nsp1(1—3sg), nsp1(11—15sg), nsp1(19-22sg), nsp1(37-40sg), nsp1(46—-48sg), nsp1(55—60sg), nsp1(66— 
69sg), nsp1(76—84sg), nsp1(91—95sg), and nsp1(105—109sg). HEK-293T cells were transfected with the 
control plasmid (PCAGGS) or nsp1 plasmids. At 24 h, the cells were lysed and subjected to Rluc assays 
and western blot analysis. The error bars show the standard deviation of the results from three 
independent experiments. The asterisks indicate statistical significance calculated using Student’s t-test. 
*P < 0.05; **P < 0.01; ***P < 0.001. 
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Figure 3. The motif comprising amino acids 91-95 is a critical region for TGEV nsp1-induced 
inhibition of host protein synthesis. (A) HEK-293T cells were cotransfected with pRL-SV40 encoding 
the Rluc reporter gene downstream of the SV40 promoter and one of the following plasmids: PCAGGS, 
PCAGGS-TGEV-nsp1-HA, PCAGGS-TGEV-nsp1(37—40sg)-HA, or PCAGGS-TGEV-nsp1(91—95sg)- 
HA, which encode no gene, TGEV nsp1, TGEV nsp1(37—-40sg), and TGEV nsp1(91—95sg), respectively. 
At 12, 24, and 48 h post transfection, the cells were lysed and subjected to real-time quantitative PCR 
analysis. The values of TGEV nsp1, TGEV nsp1(37-40sg), and TGEV nsp1(91—95sg) were normalized 
to those of the untreated empty vector (PCAGGS) control, which were set to 1. (B) HEK-293T Cells were 
transfected with different doses of the TGEV nsp1(37—-40sg) plasmid (O to 2.0 ug) for 24 h. The cells 
were pulsed with 3 uM puromycin for 1 h at 37°C and then subjected to western blot analysis (left). The 
grayscale values of the protein bands were analyzed by ImageJ (right). (C) Cells were transfected with 
different doses of the TGEV nsp1(91—95sg) plasmid (0 to 2.0 ug) for 24 h. The cells were pulsed with 3 
uM puromycin for | h at 37°C and then subjected to western blot analysis (left). The grayscale values of 
the protein bands were analyzed by ImageJ (right). (D) The cells were pulsed with 3 uM puromycin for 1 
h after transfection for 0, 12, 24, and 36 h (left) and then subjected to western blot analysis (right). The 
grayscale values of the protein bands were analyzed by ImageJ. Data represented as mean + standard 
deviation, n = 3, *P < 0.05, **P < 0.01, ***P < 0.001. 
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Figure 4. Structural comparisons of TGEV nsp1 and TGEV nsp1(91-95sg). (A) Views from opposite 
sides of the electrostatic surface of TGEV nsp1. (B) Views from opposite sides of the electrostatic surface 
of TGEV nsp1(91—95sg). Positive charges are shown in blue, and negative charges are shown in red. (C) 
Views from opposite sides of a cartoon model of TGEV nsp1, with the motif (amino acids 91-95) marked 
in black. (D) Views from opposite sides of a cartoon model of TGEV nsp1(91-95sg), with the motif 
(amino acids 91-95) marked in black. TGEV nsp1 is shown in yellow, and TGEV nsp1(91-95sg) is 
shown in red. (E) Ribbon diagrams of the two structures. (F) Root mean square deviations (RMSDs) 
between TGEV nsp1 and TGEV nsp1(91—95sg). The RMSDs were calculated using PDBeFold. 
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Figure 5. The motif comprising amino acids 91—95 is an important region for a-CoV nsp1-induced 
inhibition of gene expression. (A) The following sequences from GenBank were used to create the 
sequence alignment (given as the name, abbreviation and GenBank accession number): transmissible 
gastroenteritis virus (TGEV), HQ462571.1; feline infectious peritonitis virus (FIPV), ADL71484.1; 
porcine epidemic diarrhea virus (PEDV), AJP67455.1; human coronavirus 229E (HCoV-229B), 
CAA49377.1; and human coronavirus NL63 (HCoV-NL63), AFV53147.1. The residue numbers with 
reference to TGEV nsp1 are labeled on top of the panel. The regions comprising amino acids 91—95 are 
marked by small black dots in the a-CoV nsp1 proteins. Residues conserved in most sequences are shown 
in red and are boxed with a white background. The sequences were aligned with ClustalW2, and the 
figure was prepared with ESPript3.0. (B and C) Host protein synthesis inhibited by “sg” replacement 
mutants of a-CoV nsp1. HEK-293T cells were cotransfected with pRL-SV40 or pRL-TK and one of the 
following plasmids: PCAGGS (control), PCAGGS-TGEV-nspl, PCAGGS-TGEV-nsp1(91—95sg), 
PCAGGS-FIPV-nspl, PCAGGS-FIPV-nsp1(91—95sg), PCAGGS-HCoV-229E-nsp1, PCAGGS-HCoV- 
229E-nsp1(92-96sg), PCAGGS-HCoV-NL63-nsp1, PCAGGS-HCoV-NL63-nsp1(92—96sg), PCAGGS- 
PEDV-nsp1l, or PCAGGS-PEDV-nsp1(92—96sg), which encodes no protein, TGEV nspl, TGEV 
nsp1(91—95sg), FIPV nsp1, FIPV nsp1(91—95sg), HCoV-229E nsp1, HCoV-229E nsp1(92—96sg), HCoV- 
NL63 nspl, or HCoV-NL63 nsp1(92—96sg), respectively. At 24 h posttransfection, the cells were lysed 
and subjected to Rluc assays and western blot analysis. Data represented as mean + standard deviation, n 
= 3, *P < 0.05, **P < 0.01, ***P < 0.001. (D) The fold change in Rluc activity in the mutant protein 
relative to the corresponding wild-type protein is shown. 
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Figure 6. Effects of the recombinant virus at the cellular level. (A) DNA-based reverse genetics BAC 
system for TGEV and TGEV(91—95sg). The relative positions of the mutated genes used for molecular 
cloning are indicated. (B and C) The recombinant virus was successfully used to infect susceptible PK15 
and ST cells. TGEV-BAC and TGEV(91-95sg)-BAC were transfected into HEK-293T cells with 
Lipo3000, and then the virus was inoculated into PK15 and ST cells 24 h later. Green fluorescence was 
observed after 36 h. The expression of TGEV N and TGEV nsp1 proteins in cells was also detected by 
western blot analysis using anti-TGEV N and anti-TGEV nsp1 antibodies, respectively. GAPDH served 
as the loading control. (D and E) Multistep growth curves for TGEV and TGEV(91—95sg) in PK15 and 
ST cells at an MOI of 0.1. The virus titers at different time points, as indicated, were determined by an 
endpoint dilution assay. Data represented as mean + standard deviation, n = 3. 
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Figure 7. Pathogenicity analysis of TGEV and TGEV(91-95sg). (A) Piglets were inoculated with the 
viruses. The average weight gain of the piglets was recorded on the day of death or euthanasia as the final 
time point. In the TGEV group, one piglet died by the third day, and the other piglets died by the fourth 
day. In the TGEV(91-95sg) and mock groups, the piglets were euthanized on the eighth day. Data 
represented as mean + standard deviation, n = 3-5. (B) Clinical mental state scores of the piglets in the 
different groups. The following criteria were used for evaluation: 0, normal; 1, mild lethargy (slow to 
move; head down); 2, moderate lethargy (stands but tends to lie down); 3, heavier lethargy (lies down; 
occasionally stands); and 4, severe lethargy (recumbent; moribund). Data represented as mean + standard 
deviation, n = 3-5. (C) Survival rates of the piglets in each group. Survival curves for the piglets infected 
with the recovered viruses in each group are shown. (D) Gross lesions in piglets inoculated with the 
recombinant viruses. The intestinal lesions were examined on the day of death or after euthanasia at the 
final time points. The necropsy images show transparent intestines observed in piglets inoculated with 
TGEV and TGEV(91-95sg) but not in mock-inoculated piglets. (E) Histopathological examination of the 
intestines from the recombinant TGEV-infected piglets. Different segments, including the duodenum, 
jejunum, ileum, large intestine and stomach, were taken from each group and then processed for H&E 
staining. Representative images are shown. 
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Figure 8. Virus shedding in feces and immunohistochemistry of the intestine and stomach. (A) Daily 
virus shedding in feces from the different groups was measured by reverse transcription PCR detection of 
viral genomes in fecal swabs. The length of the target fragment is approximately 1800 bp. (B) 
Immunohistochemistry of different tissue sections, including the stomach, duodenum, jejunum, ileum and 
large intestine, from the piglets. The black arrows point to the virus in the tissue. 
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